Dataset info
| Number of variables | 16 |
|---|---|
| Number of observations | 421570 |
| Missing cells | 1422431 (21.1%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 51.9 MiB |
| Average record size in memory | 129.0 B |
Variables types
| Numeric | 13 |
|---|---|
| Categorical | 2 |
| Boolean | 1 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 0 |
| Rejected | 0 |
| Unsupported | 0 |
Warnings
Date only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Date has a high cardinality: 143 distinct values | Warning |
MarkDown1 has 270889 (64.3%) missing values | Missing |
MarkDown2 has 310322 (73.6%) missing values | Missing |
MarkDown3 has 284479 (67.5%) missing values | Missing |
MarkDown4 has 286603 (68.0%) missing values | Missing |
MarkDown5 has 270138 (64.1%) missing values | Missing |
CPI
Numeric
| Distinct count | 2145 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 171.2019468 |
|---|---|
| Minimum | 126.064 |
| Maximum | 227.2328068 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 126.064 |
|---|---|
| 5-th percentile | 126.4962581 |
| Q1 | 132.0226667 |
| Median | 182.3187801 |
| Q3 | 212.4169928 |
| 95-th percentile | 221.9415576 |
| Maximum | 227.2328068 |
| Range | 101.1688068 |
| Interquartile range | 80.3943261 |
Descriptive statistics
| Standard deviation | 39.15927562 |
|---|---|
| Coef of variation | 0.2287314855 |
| Kurtosis | -1.829714364 |
| Mean | 171.2019468 |
| MAD | 38.06622321 |
| Skewness | 0.08521928473 |
| Sum | 72173604.72 |
| Variance | 1533.448867 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 129.8555333 | 711 | 0.2% | |
| 131.1083333 | 708 | 0.2% | |
| 129.8459667 | 707 | 0.2% | |
| 130.3849032 | 706 | 0.2% | |
| 130.683 | 706 | 0.2% | |
| 131.0756667 | 706 | 0.2% | |
| 130.6457931 | 706 | 0.2% | |
| 130.7196333 | 705 | 0.2% | |
| 130.4546207 | 705 | 0.2% | |
| 129.9845484 | 704 | 0.2% | |
| Other values (2135) | 414506 | 98.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 126.064 | 678 | 0.2% | |
| 126.0766452 | 679 | 0.2% | |
| 126.0854516 | 675 | 0.2% | |
| 126.0892903 | 682 | 0.2% | |
| 126.1019355 | 686 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 227.2328068 | 63 | < 0.1% | |
| 227.214288 | 62 | < 0.1% | |
| 227.1693919 | 63 | < 0.1% | |
| 227.0369359 | 70 | < 0.1% | |
| 227.0184166 | 69 | < 0.1% |
Date
Categorical
| Distinct count | 143 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2011-12-23 | 3027 |
|---|---|
| 2011-11-25 | 3021 |
| 2011-12-16 | 3013 |
| Other values (140) |
| Value | Count | Frequency (%) | |
| 2011-12-23 | 3027 | 0.7% | |
| 2011-11-25 | 3021 | 0.7% | |
| 2011-12-16 | 3013 | 0.7% | |
| 2011-12-09 | 3010 | 0.7% | |
| 2012-02-17 | 3007 | 0.7% | |
| 2011-12-30 | 3003 | 0.7% | |
| 2012-02-10 | 3001 | 0.7% | |
| 2011-12-02 | 2994 | 0.7% | |
| 2012-03-02 | 2990 | 0.7% | |
| 2012-10-12 | 2990 | 0.7% | |
| Other values (133) | 391514 | 92.9% |
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
Dept
Numeric
| Distinct count | 81 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.26031739 |
|---|---|
| Minimum | 1 |
| Maximum | 99 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| Median | 37 |
| Q3 | 74 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range | 56 |
Descriptive statistics
| Standard deviation | 30.49205402 |
|---|---|
| Coef of variation | 0.6889253358 |
| Kurtosis | -1.215570579 |
| Mean | 44.26031739 |
| MAD | 26.53702063 |
| Skewness | 0.3582231935 |
| Sum | 18658822 |
| Variance | 929.7653581 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 1 | 6435 | 1.5% | |
| 10 | 6435 | 1.5% | |
| 38 | 6435 | 1.5% | |
| 21 | 6435 | 1.5% | |
| 67 | 6435 | 1.5% | |
| 16 | 6435 | 1.5% | |
| 14 | 6435 | 1.5% | |
| 13 | 6435 | 1.5% | |
| 79 | 6435 | 1.5% | |
| 81 | 6435 | 1.5% | |
| Other values (71) | 357220 | 84.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 6435 | 1.5% | |
| 2 | 6435 | 1.5% | |
| 3 | 6435 | 1.5% | |
| 4 | 6435 | 1.5% | |
| 5 | 6347 | 1.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 99 | 862 | 0.2% | |
| 98 | 5836 | 1.4% | |
| 97 | 6278 | 1.5% | |
| 96 | 4854 | 1.2% | |
| 95 | 6435 | 1.5% |
Fuel_Price
Numeric
| Distinct count | 892 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.361026527 |
|---|---|
| Minimum | 2.472 |
| Maximum | 4.468 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 2.472 |
|---|---|
| 5-th percentile | 2.653 |
| Q1 | 2.933 |
| Median | 3.452 |
| Q3 | 3.738 |
| 95-th percentile | 4.029 |
| Maximum | 4.468 |
| Range | 1.996 |
| Interquartile range | 0.805 |
Descriptive statistics
| Standard deviation | 0.4585145371 |
|---|---|
| Coef of variation | 0.1364209813 |
| Kurtosis | -1.185404505 |
| Mean | 3.361026527 |
| MAD | 0.4031996485 |
| Skewness | -0.1049014956 |
| Sum | 1416907.953 |
| Variance | 0.2102355808 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 3.638 | 2548 | 0.6% | |
| 3.63 | 2164 | 0.5% | |
| 2.771 | 1917 | 0.5% | |
| 3.891 | 1856 | 0.4% | |
| 3.594 | 1796 | 0.4% | |
| 3.524 | 1793 | 0.4% | |
| 3.523 | 1792 | 0.4% | |
| 2.72 | 1790 | 0.4% | |
| 3.666 | 1778 | 0.4% | |
| 2.78 | 1656 | 0.4% | |
| Other values (882) | 402480 | 95.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 2.472 | 38 | < 0.1% | |
| 2.513 | 45 | < 0.1% | |
| 2.514 | 906 | 0.2% | |
| 2.52 | 39 | < 0.1% | |
| 2.533 | 42 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 4.468 | 368 | 0.1% | |
| 4.449 | 358 | 0.1% | |
| 4.308 | 168 | < 0.1% | |
| 4.301 | 360 | 0.1% | |
| 4.294 | 363 | 0.1% |
IsHoliday
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 29661 |
| Value | Count | Frequency (%) | |
| False | 391909 | 93.0% | |
| True | 29661 | 7.0% |
MarkDown1
Numeric
| Distinct count | 2278 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 64.3% |
| Missing (n) | 270889 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 7246.420196 |
|---|---|
| Minimum | 0.27 |
| Maximum | 88646.76 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.27 |
|---|---|
| 5-th percentile | 149.19 |
| Q1 | 2240.27 |
| Median | 5347.45 |
| Q3 | 9210.9 |
| 95-th percentile | 21801.35 |
| Maximum | 88646.76 |
| Range | 88646.49 |
| Interquartile range | 6970.63 |
Descriptive statistics
| Standard deviation | 8291.221345 |
|---|---|
| Coef of variation | 1.144181695 |
| Kurtosis | 17.60626321 |
| Mean | 7246.420196 |
| MAD | 5262.753849 |
| Skewness | 3.341844686 |
| Sum | 1091897842 |
| Variance | 68744351.4 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 1.5 | 102 | < 0.1% | |
| 460.73 | 102 | < 0.1% | |
| 175.64 | 93 | < 0.1% | |
| 1282.42 | 75 | < 0.1% | |
| 9264.48 | 75 | < 0.1% | |
| 686.24 | 75 | < 0.1% | |
| 5924.71 | 75 | < 0.1% | |
| 1483.17 | 75 | < 0.1% | |
| 3242.59 | 74 | < 0.1% | |
| 10671.71 | 74 | < 0.1% | |
| Other values (2267) | 149861 | 35.5% | |
| (Missing) | 270889 | 64.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.27 | 51 | < 0.1% | |
| 0.5 | 49 | < 0.1% | |
| 1.5 | 102 | < 0.1% | |
| 1.94 | 50 | < 0.1% | |
| 2.12 | 52 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 88646.76 | 68 | < 0.1% | |
| 78124.5 | 70 | < 0.1% | |
| 75149.79 | 73 | < 0.1% | |
| 65021.23 | 73 | < 0.1% | |
| 62567.6 | 66 | < 0.1% |
MarkDown2
Numeric
| Distinct count | 1500 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 73.6% |
| Missing (n) | 310322 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3334.628621 |
|---|---|
| Minimum | -265.76 |
| Maximum | 104519.54 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | -265.76 |
|---|---|
| 5-th percentile | 1.95 |
| Q1 | 41.6 |
| Median | 192 |
| Q3 | 1926.94 |
| 95-th percentile | 16497.47 |
| Maximum | 104519.54 |
| Range | 104785.3 |
| Interquartile range | 1885.34 |
Descriptive statistics
| Standard deviation | 9475.357325 |
|---|---|
| Coef of variation | 2.841503028 |
| Kurtosis | 37.58956105 |
| Mean | 3334.628621 |
| MAD | 4690.43368 |
| Skewness | 5.441261196 |
| Sum | 370970764.8 |
| Variance | 89782396.45 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 1.91 | 539 | 0.1% | |
| 3 | 493 | 0.1% | |
| 0.5 | 485 | 0.1% | |
| 1.5 | 471 | 0.1% | |
| 4 | 367 | 0.1% | |
| 6 | 365 | 0.1% | |
| 7.64 | 354 | 0.1% | |
| 3.82 | 353 | 0.1% | |
| 5.73 | 345 | 0.1% | |
| 19 | 345 | 0.1% | |
| Other values (1489) | 107131 | 25.4% | |
| (Missing) | 310322 | 73.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -265.76 | 71 | < 0.1% | |
| -192 | 72 | < 0.1% | |
| -20 | 72 | < 0.1% | |
| -10.98 | 60 | < 0.1% | |
| -10.5 | 143 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 104519.54 | 72 | < 0.1% | |
| 97740.99 | 73 | < 0.1% | |
| 92523.94 | 73 | < 0.1% | |
| 89121.94 | 74 | < 0.1% | |
| 82881.16 | 73 | < 0.1% |
MarkDown3
Numeric
| Distinct count | 1663 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 67.5% |
| Missing (n) | 284479 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1439.421384 |
|---|---|
| Minimum | -29.1 |
| Maximum | 141630.61 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | -29.1 |
|---|---|
| 5-th percentile | 0.65 |
| Q1 | 5.08 |
| Median | 24.6 |
| Q3 | 103.99 |
| 95-th percentile | 1059.9 |
| Maximum | 141630.61 |
| Range | 141659.71 |
| Interquartile range | 98.91 |
Descriptive statistics
| Standard deviation | 9623.07829 |
|---|---|
| Coef of variation | 6.685379553 |
| Kurtosis | 77.68777203 |
| Mean | 1439.421384 |
| MAD | 2578.055572 |
| Skewness | 8.399453018 |
| Sum | 197331717 |
| Variance | 92603635.78 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 3 | 754 | 0.2% | |
| 6 | 710 | 0.2% | |
| 2 | 660 | 0.2% | |
| 1 | 611 | 0.1% | |
| 0.22 | 487 | 0.1% | |
| 0.5 | 463 | 0.1% | |
| 0.01 | 444 | 0.1% | |
| 4 | 439 | 0.1% | |
| 3.2 | 379 | 0.1% | |
| 1.98 | 363 | 0.1% | |
| Other values (1652) | 131781 | 31.3% | |
| (Missing) | 284479 | 67.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -29.1 | 72 | < 0.1% | |
| -1 | 70 | < 0.1% | |
| -0.87 | 46 | < 0.1% | |
| -0.2 | 69 | < 0.1% | |
| 0 | 67 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 141630.61 | 74 | < 0.1% | |
| 109030.75 | 75 | < 0.1% | |
| 103991.94 | 72 | < 0.1% | |
| 101378.79 | 73 | < 0.1% | |
| 89402.64 | 71 | < 0.1% |
MarkDown4
Numeric
| Distinct count | 1945 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 68.0% |
| Missing (n) | 286603 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3383.168256 |
|---|---|
| Minimum | 0.22 |
| Maximum | 67474.85 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.22 |
|---|---|
| 5-th percentile | 28.76 |
| Q1 | 504.22 |
| Median | 1481.31 |
| Q3 | 3595.04 |
| 95-th percentile | 12645.96 |
| Maximum | 67474.85 |
| Range | 67474.63 |
| Interquartile range | 3090.82 |
Descriptive statistics
| Standard deviation | 6292.384031 |
|---|---|
| Coef of variation | 1.859908688 |
| Kurtosis | 29.99681491 |
| Mean | 3383.168256 |
| MAD | 3329.708762 |
| Skewness | 4.847500037 |
| Sum | 456616070 |
| Variance | 39594096.79 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 9 | 280 | 0.1% | |
| 4 | 200 | < 0.1% | |
| 2 | 197 | < 0.1% | |
| 3 | 146 | < 0.1% | |
| 47 | 143 | < 0.1% | |
| 67.72 | 142 | < 0.1% | |
| 17 | 141 | < 0.1% | |
| 657.56 | 141 | < 0.1% | |
| 8 | 140 | < 0.1% | |
| 1330.36 | 140 | < 0.1% | |
| Other values (1934) | 133297 | 31.6% | |
| (Missing) | 286603 | 68.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.22 | 57 | < 0.1% | |
| 0.41 | 52 | < 0.1% | |
| 0.46 | 48 | < 0.1% | |
| 0.78 | 52 | < 0.1% | |
| 0.87 | 49 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 67474.85 | 72 | < 0.1% | |
| 57817.56 | 74 | < 0.1% | |
| 57815.43 | 68 | < 0.1% | |
| 53603.99 | 72 | < 0.1% | |
| 52739.02 | 72 | < 0.1% |
MarkDown5
Numeric
| Distinct count | 2294 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 64.1% |
| Missing (n) | 270138 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4628.975079 |
|---|---|
| Minimum | 135.16 |
| Maximum | 108519.28 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 135.16 |
|---|---|
| 5-th percentile | 715.52 |
| Q1 | 1878.44 |
| Median | 3359.45 |
| Q3 | 5563.8 |
| 95-th percentile | 11269.24 |
| Maximum | 108519.28 |
| Range | 108384.12 |
| Interquartile range | 3685.36 |
Descriptive statistics
| Standard deviation | 5962.887455 |
|---|---|
| Coef of variation | 1.288165815 |
| Kurtosis | 107.8492655 |
| Mean | 4628.975079 |
| MAD | 2989.7584 |
| Skewness | 8.169909544 |
| Sum | 700974954.2 |
| Variance | 35556026.8 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 2743.18 | 136 | < 0.1% | |
| 1064.56 | 120 | < 0.1% | |
| 9083.54 | 75 | < 0.1% | |
| 20371.02 | 75 | < 0.1% | |
| 3567.03 | 75 | < 0.1% | |
| 4180.29 | 75 | < 0.1% | |
| 3557.67 | 75 | < 0.1% | |
| 986.23 | 74 | < 0.1% | |
| 1773.53 | 74 | < 0.1% | |
| 14660.97 | 74 | < 0.1% | |
| Other values (2283) | 150579 | 35.7% | |
| (Missing) | 270138 | 64.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 135.16 | 65 | < 0.1% | |
| 153.04 | 47 | < 0.1% | |
| 153.9 | 49 | < 0.1% | |
| 164.08 | 52 | < 0.1% | |
| 170.64 | 69 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 108519.28 | 68 | < 0.1% | |
| 105223.11 | 70 | < 0.1% | |
| 85851.87 | 68 | < 0.1% | |
| 63005.58 | 69 | < 0.1% | |
| 58068.14 | 69 | < 0.1% |
Size
Numeric
| Distinct count | 40 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 136727.9157 |
|---|---|
| Minimum | 34875 |
| Maximum | 219622 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 34875 |
|---|---|
| 5-th percentile | 39690 |
| Q1 | 93638 |
| Median | 140167 |
| Q3 | 202505 |
| 95-th percentile | 206302 |
| Maximum | 219622 |
| Range | 184747 |
| Interquartile range | 108867 |
Descriptive statistics
| Standard deviation | 60980.58333 |
|---|---|
| Coef of variation | 0.4459995093 |
| Kurtosis | -1.206345903 |
| Mean | 136727.9157 |
| MAD | 52517.08823 |
| Skewness | -0.3258497665 |
| Sum | 5.764038744e+10 |
| Variance | 3718631543 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 39690 | 20802 | 4.9% | |
| 39910 | 20597 | 4.9% | |
| 203819 | 20376 | 4.8% | |
| 219622 | 10474 | 2.5% | |
| 126512 | 10315 | 2.4% | |
| 205863 | 10272 | 2.4% | |
| 151315 | 10244 | 2.4% | |
| 202307 | 10238 | 2.4% | |
| 204184 | 10225 | 2.4% | |
| 158114 | 10224 | 2.4% | |
| Other values (30) | 287803 | 68.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 34875 | 8999 | 2.1% | |
| 37392 | 9036 | 2.1% | |
| 39690 | 20802 | 4.9% | |
| 39910 | 20597 | 4.9% | |
| 41062 | 6751 | 1.6% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 219622 | 10474 | 2.5% | |
| 207499 | 10062 | 2.4% | |
| 206302 | 10113 | 2.4% | |
| 205863 | 10272 | 2.4% | |
| 204184 | 10225 | 2.4% |
Store
Numeric
| Distinct count | 45 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 22.20054558 |
|---|---|
| Minimum | 1 |
| Maximum | 45 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| Median | 22 |
| Q3 | 33 |
| 95-th percentile | 43 |
| Maximum | 45 |
| Range | 44 |
| Interquartile range | 22 |
Descriptive statistics
| Standard deviation | 12.78529739 |
|---|---|
| Coef of variation | 0.5759001437 |
| Kurtosis | -1.146502781 |
| Mean | 22.20054558 |
| MAD | 10.99608175 |
| Skewness | 0.07776250175 |
| Sum | 9359084 |
| Variance | 163.4638293 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 13 | 10474 | 2.5% | |
| 10 | 10315 | 2.4% | |
| 4 | 10272 | 2.4% | |
| 1 | 10244 | 2.4% | |
| 2 | 10238 | 2.4% | |
| 24 | 10228 | 2.4% | |
| 27 | 10225 | 2.4% | |
| 34 | 10224 | 2.4% | |
| 20 | 10214 | 2.4% | |
| 6 | 10211 | 2.4% | |
| Other values (35) | 318925 | 75.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 10244 | 2.4% | |
| 2 | 10238 | 2.4% | |
| 3 | 9036 | 2.1% | |
| 4 | 10272 | 2.4% | |
| 5 | 8999 | 2.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 45 | 9637 | 2.3% | |
| 44 | 7169 | 1.7% | |
| 43 | 6751 | 1.6% | |
| 42 | 6953 | 1.6% | |
| 41 | 10088 | 2.4% |
Temperature
Numeric
| Distinct count | 3528 |
|---|---|
| Unique (%) | 0.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 60.09005873 |
|---|---|
| Minimum | -2.06 |
| Maximum | 100.14 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | -2.06 |
|---|---|
| 5-th percentile | 27.31 |
| Q1 | 46.68 |
| Median | 62.09 |
| Q3 | 74.28 |
| 95-th percentile | 87.27 |
| Maximum | 100.14 |
| Range | 102.2 |
| Interquartile range | 27.6 |
Descriptive statistics
| Standard deviation | 18.44793115 |
|---|---|
| Coef of variation | 0.3070047115 |
| Kurtosis | -0.6359219778 |
| Mean | 60.09005873 |
| MAD | 15.37733082 |
| Skewness | -0.321404152 |
| Sum | 25332166.06 |
| Variance | 340.3261636 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 50.43 | 709 | 0.2% | |
| 67.87 | 646 | 0.2% | |
| 72.62 | 594 | 0.1% | |
| 76.67 | 583 | 0.1% | |
| 70.28 | 563 | 0.1% | |
| 76.03 | 555 | 0.1% | |
| 50.56 | 544 | 0.1% | |
| 64.05 | 542 | 0.1% | |
| 64.21 | 519 | 0.1% | |
| 50.81 | 487 | 0.1% | |
| Other values (3518) | 415828 | 98.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -2.06 | 69 | < 0.1% | |
| 5.54 | 68 | < 0.1% | |
| 6.23 | 69 | < 0.1% | |
| 7.46 | 69 | < 0.1% | |
| 9.51 | 70 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100.14 | 44 | < 0.1% | |
| 100.07 | 46 | < 0.1% | |
| 99.66 | 48 | < 0.1% | |
| 99.22 | 185 | < 0.1% | |
| 99.2 | 46 | < 0.1% |
Type
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| A | |
|---|---|
| B | |
| C | 42597 |
| Value | Count | Frequency (%) | |
| A | 215478 | 51.1% | |
| B | 163495 | 38.8% | |
| C | 42597 | 10.1% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
Unemployment
Numeric
| Distinct count | 349 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 7.960288695 |
|---|---|
| Minimum | 3.879 |
| Maximum | 14.313 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 3.879 |
|---|---|
| 5-th percentile | 5.326 |
| Q1 | 6.891 |
| Median | 7.866 |
| Q3 | 8.572 |
| 95-th percentile | 12.187 |
| Maximum | 14.313 |
| Range | 10.434 |
| Interquartile range | 1.681 |
Descriptive statistics
| Standard deviation | 1.863296038 |
|---|---|
| Coef of variation | 0.2340739275 |
| Kurtosis | 2.73121663 |
| Mean | 7.960288695 |
| MAD | 1.283045348 |
| Skewness | 1.183742568 |
| Sum | 3355818.905 |
| Variance | 3.471872127 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 8.099 | 5152 | 1.2% | |
| 8.163 | 3636 | 0.9% | |
| 7.852 | 3614 | 0.9% | |
| 7.343 | 3416 | 0.8% | |
| 7.057 | 3414 | 0.8% | |
| 7.931 | 3400 | 0.8% | |
| 7.441 | 3397 | 0.8% | |
| 6.565 | 3370 | 0.8% | |
| 8.2 | 3361 | 0.8% | |
| 6.891 | 3360 | 0.8% | |
| Other values (339) | 385450 | 91.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 3.879 | 287 | 0.1% | |
| 4.077 | 938 | 0.2% | |
| 4.125 | 1831 | 0.4% | |
| 4.145 | 562 | 0.1% | |
| 4.156 | 1815 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 14.313 | 2636 | 0.6% | |
| 14.18 | 2423 | 0.6% | |
| 14.099 | 2441 | 0.6% | |
| 14.021 | 2263 | 0.5% | |
| 13.975 | 1529 | 0.4% |
Weekly_Sales
Numeric
| Distinct count | 359464 |
|---|---|
| Unique (%) | 85.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 15981.25812 |
|---|---|
| Minimum | -4988.94 |
| Maximum | 693099.36 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | -4988.94 |
|---|---|
| 5-th percentile | 59.9745 |
| Q1 | 2079.65 |
| Median | 7612.03 |
| Q3 | 20205.8525 |
| 95-th percentile | 61201.951 |
| Maximum | 693099.36 |
| Range | 698088.3 |
| Interquartile range | 18126.2025 |
Descriptive statistics
| Standard deviation | 22711.18352 |
|---|---|
| Coef of variation | 1.421113616 |
| Kurtosis | 21.49128991 |
| Mean | 15981.25812 |
| MAD | 15161.44355 |
| Skewness | 3.262008185 |
| Sum | 6737218987 |
| Variance | 515797856.8 |
| Memory size | 6.4 MiB |
| Value | Count | Frequency (%) | |
| 10 | 353 | 0.1% | |
| 5 | 289 | 0.1% | |
| 20 | 232 | 0.1% | |
| 15 | 215 | 0.1% | |
| 12 | 175 | < 0.1% | |
| 1 | 169 | < 0.1% | |
| 10.47 | 167 | < 0.1% | |
| 11.97 | 154 | < 0.1% | |
| 2 | 148 | < 0.1% | |
| 7 | 146 | < 0.1% | |
| Other values (359454) | 419522 | 99.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -4988.94 | 1 | < 0.1% | |
| -3924 | 1 | < 0.1% | |
| -1750 | 1 | < 0.1% | |
| -1699 | 1 | < 0.1% | |
| -1321.48 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 693099.36 | 1 | < 0.1% | |
| 649770.18 | 1 | < 0.1% | |
| 630999.19 | 1 | < 0.1% | |
| 627962.93 | 1 | < 0.1% | |
| 474330.1 | 1 | < 0.1% |
First rows
| CPI | Date | Dept | Fuel_Price | IsHoliday | MarkDown1 | MarkDown2 | MarkDown3 | MarkDown4 | MarkDown5 | Size | Store | Temperature | Type | Unemployment | Weekly_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 211.096358 | 2010-02-05 | 1 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 24924.50 |
| 1 | 211.096358 | 2010-02-05 | 2 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 50605.27 |
| 2 | 211.096358 | 2010-02-05 | 3 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 13740.12 |
| 3 | 211.096358 | 2010-02-05 | 4 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 39954.04 |
| 4 | 211.096358 | 2010-02-05 | 5 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 32229.38 |
| 5 | 211.096358 | 2010-02-05 | 6 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 5749.03 |
| 6 | 211.096358 | 2010-02-05 | 7 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 21084.08 |
| 7 | 211.096358 | 2010-02-05 | 8 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 40129.01 |
| 8 | 211.096358 | 2010-02-05 | 9 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 16930.99 |
| 9 | 211.096358 | 2010-02-05 | 10 | 2.572 | False | NaN | NaN | NaN | NaN | NaN | 151315 | 1 | 42.31 | A | 8.106 | 30721.50 |
Last rows
| CPI | Date | Dept | Fuel_Price | IsHoliday | MarkDown1 | MarkDown2 | MarkDown3 | MarkDown4 | MarkDown5 | Size | Store | Temperature | Type | Unemployment | Weekly_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 421560 | 192.308899 | 2012-10-26 | 85 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 1689.10 |
| 421561 | 192.308899 | 2012-10-26 | 87 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 8187.66 |
| 421562 | 192.308899 | 2012-10-26 | 90 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 25352.32 |
| 421563 | 192.308899 | 2012-10-26 | 91 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 16330.84 |
| 421564 | 192.308899 | 2012-10-26 | 92 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 54608.75 |
| 421565 | 192.308899 | 2012-10-26 | 93 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 2487.80 |
| 421566 | 192.308899 | 2012-10-26 | 94 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 5203.31 |
| 421567 | 192.308899 | 2012-10-26 | 95 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 56017.47 |
| 421568 | 192.308899 | 2012-10-26 | 97 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 6817.48 |
| 421569 | 192.308899 | 2012-10-26 | 98 | 3.882 | False | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 118221 | 45 | 58.85 | B | 8.667 | 1076.80 |